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EXTREMAL  QUANTILES  AND  VALUE-AT-RISK 

VICTOR  CHERNOZHUKOV  AND  SONGZI  DU 

Abstract.  This  article  looks  at  the  theory  and  empirics  of  extremal  quantiles  in  eco- 
nomics, in  particular  value-at-risk.  The  theory  of  extremes  has  gone  through  remarkable 
developments  and  produced  valuable  empirical  findings  in  the  last  20  years.  In  the  dis- 
cussion, we  put  a  particular  focus  on  conditional  extremal  quantile  models  and  methods, 
which  have  applications  in  many  areas  of  economic  analysis.  Examples  of  applications 
include  the  analysis  of  factors  of  high  risk  in  finance  and  risk  management,  the  analysis 
of  socio-economic  factors  that  contribute  to  extremely  low  infant  birthweights,  efficiency 
analysis  in  industrial  organization,  the  analysis  of  reservation  rules  in  economic  decisions, 
and  inference  in  structural  auction  models. 
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1.  Introduction 

1.1.  Some  Basics.  Let  a  real  random  variable  Y  have  a  continuous  distribution  function 
Fyd/)  =  Prob[Y  <  y]-  A  r-quantile  of  Y  is  the  number  Fy  (t)  such  that  Prob[Y  < 
Fy^{t)]  —  T  for  some  r  G  (0,1).^  The  quantile  function  Fy  (t),  viewed  as  a  function  of 
probability  index  r,  is  the  inverse  of  the  distribution  function  Fy(j/).  The  quantile  function 
is  therefore  a  complete  description  of  the  distribution. 

Let  X  be  a  vector  of  regressor  variables.  Let  Fy-(y|x)  =  Prob[Y  <  y\x]  denote  the 
conditional  distribution  function  of  Y  given  A'  =  x.  The  conditional  r-quantile  of  a  random 
variable  Y  with  a  continuous  conditional  distribution  function  is  the  number  Fy'^{T\x)  such 
that  Prob[Y  <  Fy  (r|x)jA  =  x]  —  t.  The  conditional  quantile  function  Fy^{t\x)  viewed 
as  a  function  of  x  is  called  the  r-quantile  regression  function.  The  main  use  of  the  quantile 
regression  function  Fy  (r|a")  is  to  measure  the  effect  of  covariates  on  outcomes,  both  in  the 
center  and  in  the  upper  and  lower  tails  of  an  outcome  distribution.  To  this  effect,  a  quantile 
or  a  conditional  r-quantile  will  be  referred  to  as  extremal  whenever  the  probability  index  r 
is  either  low,  r  <  .15,  or  high,  r  >  .85.  Without  loss  of  generality,  we  focus  the  discussion 
on  the  low  quantiles. 

1.2.  Examples  as  a  Motivation.  There  are  many  applications  of  extremal  quantiles  in 
economics,  particularly  of  extremal  conditional  quantiles.  Here  we  give  a  sample  of  these 
applications  as  a  motivation  for  what  follows. 

Example  1.  Conditional  Value- at- Risk.  Value-at-risk  analysis  seeks  to  forecast 
or  explain  very  low  conditional  quantiles  Fy  ^(t|A)  of  an  institution's  portfoho  return,  F, 
tomorrow,  using  today's  available  information,  X  (Chernozhukov  and  Umantsev  2001). 
Typically,  extremal  quantiles  Fy  ^(rjA)  with  r  =  0.01  and  r  =  0.05  are  of  interest.  Value- 
at-risk  analysis  is  a  daily  activity  for  banking  and  other  financial  institutions,  as  required  by 
the  Securities  and  Exchange  Commission  and  the  Basle  Committee  on  Banking  Supervision. 
As  a  risk  measure,  value-at-risk  is  motivated  by  the  safety-first  decision  principle  formal- 
ized by  Roy  (1952),  in  which  one  makes  optimal  decisions  subject  to  the  constraint  that 
the  probability  of  the  risk  of  a  large  loss  is  kept  small.  This  and  similar  measures  are  com- 
monly used  in  real-life  financial  management,  insurance,  and  actuarial  science  (Embrechts, 
Kliippelberg,  and  Mikosch  1997). 

Example  2.  Determinants  of  Very  Low  Birthweights.  In  the  analysis  of  infant 
birthweights,  we  may  be  interested  in  how  smoking,  absence  of  prenatal  care,  and  other 
types  of  maternal  behavior  affect  various  birthweights  (Abrevaya  2001).  Of  special  interest, 
however,  are  the  very  low  quantiles,  since  low  birthweights  have  been  linked  to  subsequent 
health  problems.  Chernozhukov  (2006)  provides  an  empirical  study  of  extreme  birthweights. 


^More  generally,  let  Fy- '(r)  =  inf{i/  :  Fy(y)  >  t}. 
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Example  3.  Probabilistic  Production  Frontiers.  An  important  form  of  efficiency 
analj'sis  in  the  economics  industrial  organization  and  regulation  is  the  determination  of 
efficiency  or  production  frontiers  (Timmer  1971).  Given  cost  of  production  and  possibly 
other  factors,  X,  we  are  interested  in  the  highest  production  levels  that  only  a  small  fraction 
of  firms,  the  most  efficient  firms,  can  attain.  These  (nearly)  efficient  production  levels  can  be 
formally  described  by  extremal  quantile  regression  function,  Fy^{t\X)  for  t  G  [1  -  e,  1)  and 
e  >  0;  so  only  an  e-fraction  of  firms  produce  Fy7^(rjX)  or  more.  The  models  and  methods 
discussed  in  this  article  are  highly  pertinent  for  inference  on  the  probabilistic  frontiers. 

Example  4-  (S,s)-Rules  and  Other  Approximate  Reservation  Rules  in  Eco- 
nomic Decisions.  A  related  example  is  that  of  (5,  s)-adjustment  models,  which  arise  as 
optimal  policies  in  many  economic  models  (Arrow,  Harris,  and  Marschak  1951).  For  exam- 
ple, the  capital  stock  Z  is  adjusted  up  to  the  level  S  once  it  has  depreciated  to  some  low 
level  s.  In  terms  of  an  econometric  specification,  we  may  think  that  the  observed  capital 
stock  satisfies  the  equation  Z,;  =  s(X,)  + 1',,  where  Xi  are  covariates,  and  Vi  is  a  disturbance 
that  is  positive  most  of  the  time,  i.e.  Prob{vi  >  0)  is  close  to  1.  Once  the  capital  stock 
Zj  reaches  the  critical  level,  i.e.  Zi  <  s(A',:),  it  is  adjusted  in  the  next  period.  We  assume, 
as  in  Caballero  and  Engel  (1999),  that  when  the  disturbance  Vi  =  Z,  —  s(A'j)  is  negative, 
it  captures  unobserved  heterogeneity  and  small  decision  mistakes  that  are  independent  of 
observed  covariates  Xj.^  In  a  given  cross-section  or  time  series,  adjustment  will  occur  infre- 
quently, so  in  fact  data  at  or  below  the  lower  adjustment  band  s(A'i)  will  be  observed  with  a 
small  probability  Prob{v,,  <  0);  hence  F^HtIX)  =  s{X)  +  F'^t)  for  r  e  {0,Prob{v  <  0)]. 
The  lower-band  function  s{X)  therefore  coincides  with  the  lower  conditional  quantile  func- 
tion up  to  an  additive  constant.  A  similar  argument  works  for  the  upper-band  function 
S{X). 

Example  5.  Structural  Auction  Models.  In  the  standard  specification  of  the  first- 
price  procurement  auction  where  bidders  hold  independent  valuations,  the  winning  bid, 
Bi,  satisfies  the  equation  Bi  ~  c(Xj)/?(r?,,)  -f-  e,, Cj  >  0,  where  c(A'",;)  is  the  efficient  cost 
function  and  /3(n.()  >  1  is  a  mark-up  that  approaches  1  as  the  number  of  bidders,  n^, 
approaches  infinity  (Donald  and  Paarsch  2002).  By  construction,  c(A'j)/?(?ii)  is  the  extreme 
conditional  Cjuantile  function.  In  empirical  analysis,  it  is  realistic  to  let  the  disturbance  Cj 
take  some  small  negative  value  so  that,  when  negative,  these  disturbances  capture  small 
decision  mistakes  that  are  independent  of  included  explanatory  variables.  In  this  case 
the  quantile  function  satisfies  Fg\T\X,n)  =  C{X)P{n)  +  F~\t),  for  r  G  (0,  F[e  <  0]). 
Quantile  regression  methods  can  be  emploj^ed  to  make  inference  on  c{X)  and  f3{n). 


In  this  example,  Z,  could  be  any  monotone  transformation  of  the  stock  variable.  For  instance,  the  log 
transformation  gives  an  accelerated  failure  time  model  for  the  capital  stock.  Caballero  and  Engel  (1999) 
explore  such  specifications  for  empirical  (S,s)  models  in  detail. 
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1.3.  Organization  of  the  Article.  The  rest  of  the  article  is  organized  as  follows.  Section  2 
describes  the  basic  model  of  extremal  quantiles  and  extremal  conditional  quantiles.  Section 
3  describes  basic  estimation  theory  and  inference  theory.  Section  4  reviews  the  key  empirical 
applications  and  provides  an  illustrative  example.  Section  5  concludes. 

2.  Basic  Models  of  Extremal  Quantiles 

2.1.  A  Basic  Model  of  Extremal  Quantiles.  Towards  discussing  inference  methods,  as- 
sume that  the  distribution  function  of  the  response  variable  Y  has  Pareto-type  tails,  which 
means  that  tails  behave  approximately  like  power  functions.  Such  tails  are  prevalent  in 
economic  data,  as  discovered  by  the  prominent  Italian  econonometrician  Vilfredo  Pareto  in 
1895.''  Pareto-type  tails  encompass  or  approximate  a  rich  variety  of  tail  behavior,  includ- 
ing that  of  thick-tailed  and  thin-tailed  distributions,  having  either  bounded  or  unbounded 
support,  and  their  mathematical  theory  in  connection  to  extreme  value  theory  has  been 
developed  by  Gnedenko  (1943)  and  de  Haan  (1970). 

Consider  a  random  variable  F  and  define  a  random  variable  U  as  U  =  Y ,  ii  the  lower 
end-point  of  the  support  of  Y  is  — oo,  and  U  =  1'  —  Fy  ^(0),  if  the  lower  end-point  of 
the  support  of  Y  is  Fy7^(0)  >  — oo.  The  distribution  function  of  U,  denoted  by  Fy  then 
has  lower  end-point  Fj^{0)  =  — oc>  or  Fj"'(0)  =  0.  The  assumption  that  the  distribution 
function  Fu  and  its  quantile  function  FJ^  exhibit  Pareto-type  behavior  in  the  tails  can  be 
formally  stated  as  the  following  two  equivalent  conditions: 

Fuiii)     -     L{u)-ir'^^    asu\Fj\0),  (2.1) 

Fy\T)     -     L(r)T--'     asr\0,  ■  '  (2.2) 

for  some  real  number  <^  7^  0,  where  L{u)  is  a  nonparametric,  slowly-varying  function  at 
F~^{0),  and  L{t)  is  a  nonparametric  slowly-varying  function  at  0.^  The  prime  examples  of 
slowly-varying  functions  are  the  constant  function  L{y)  —  L  and  tlie  logarithmic  function. 
The  number  <^  defined  in  (2.1)  and  (2.2)  is  called  the  extreme  value  (EV)  index. 

The  absolute  value  |^|  of  the  EV  index  ^  measures  heavy-tailedness  of  distributions. 
A  distribution  Fy  with  Pareto-tj^ae  tails  necessarily  has  a  finite  lower  support  point  if 
^  <  0  and  a  infinite  lower  support  point  if  ^  >  0.  Distributions  with  ^  >  0  include  stable 
distributions,  Pareto  distributions,  t  distributions,  and  many  others.  For  example,  the  t- 
distribution  with  u  degrees  of  freedom  has  the  EV  index  ^  —  Yjv  and  exhibits  a  wide  range 
of  tail  behavior.  In  particular,  setting  v  =  \  yields  the  Cauchy  distribution  which  has  heavy 
tails  (with  <^  =  1),  while  setting  i^  =  30  yields  approximately  the  normal  distribution  which 


^See  Pareto  (1964). 


The  notation  a  ~  6  means  that  ajh  — >  1  as  appropriate  hmits  are  tal;en. 

A  function  u  1-^  L(u)  is  said  to  be  slowly-varying  at  0  if  limits  [iO/iln^Ol  =  1  for  ^i^Y  "t-  >  0. 
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has  light  tails  (with  (,  —  1/30).  On  the  other  hand,  distributions  with  <,  <  0  include  the 
uniform,  exponential,  Weibull  distributions,  and  others. 

The  assumption  of  Pareto-type  tails  can  be  equivalently  cast  in  terms  of  the  regular 
variation  assumption,  as  is  commonly  done  in  extreme  value  theory.  Distribution  func- 
tion Fu  is  said  to  be  regularly  varying  at  FJ"  (0)  with  index  of  regular  variation  —1/4  if 
lim  \^-i,g>(Ft/(ym)/F[,i(y))  =  m~'^'^,  for  any  m  >  0.  This  condition  is  equivalent  to  the 
regular  variation  of  quantile  function  Fj^  at  0  with  index  -^■.  limT\Q{Fy^ {Tm) / Fy^ {t})  = 
771"'',  for  any  m  >  0. 

It  should  be  mentioned  that  the  case  of  (^  =  0  corresponds  to  the  class  of  rapidly  varying 
distribution  functions.  These  distributions  functions  have  exponentially  light  tails,  with  the 
normal  and  exponential  distributions  being  the  chief  examples.  To  simplify  exposition,  we 
do  not  discuss  this  case  explicitly.  However,  since  the  limit  distribution  of  main  statistics  are 
continuous  in  (^,  including  at  ^  =  0,  inference  theory  for  the  case  of  i^  =  0  can  be  adecjuately 
approximated  by  the  case  of  ^  «  0. 

2.2.  A  Basic  Model  of  Extremal  Conditional  Quantiles.  Consider  the  classical  linear 
functional  form  for  the  conditional  quantile  function  of  Y  given  X  =  x: 

Fy\T\x)  =  x'f3(T).       for  aU  r  e  I  =  (O,??],  some  r?  G  (0, 1],  (2.3) 

and  for  every  x  in  the  support  of  X.  This  linear  functional  form  is  flexible  in  the  sense 
that  it  has  good  approximation  properties.  Given  the  original  regressor  A'*,  the  final  set  of 
regressors  X  to  be  used  in  estimation  can  be  formed  as  a  vector  of  approximating  functions. 
For  example,  X  may  include  power  functions,  splines,  and  other  transformations  of  A'*. 
The  linear  functional  form  also  provides  computational  convenience. 

The  following  model  for  the  tails  and  its  generalizations  were  developed  in  Chernozhukov 
(2005).  The  main  assumption  is  that  the  response  variable  Y,  transformed  by  some  auxiliary 
regression  line,  has  regularly  varying  tails  with  EV  index  c^.  Indeed,  in  addition  to  (2.3), 
suppose  there  exists  an  auxiliary  parameter  /i^  such  that  the  disturbance  U  =  Y  —  X'P^  has 
conditional  end-point  0  or  — oo  a.s.  and  its  conditional  quantile  function  F^^(t|x)  satisfies 
the  following  tail-equivalence  relationship  as  r  \  0,  uniformly  for  x  in  the  support  of  X: 

F^\t\x)^F-\t\x)-x'i3,-^F~\t),  (2.4) 

where  F^^  is  a  quantile  function  such  that 

F-Hr)^L{T)T-^,  "    .  (2..5) 

where  L{t)  is  a  nonparametric  slowly-varjdng  function  at  0.  Equation  (2.5)  imposes  Pareto- 
type  behavior  on  the  conditional  law,  while  equation  (2.4)  requires  this  behavior  to  hold 
uniformly  across  conditioning  values.  Since  this  assumption  only  affects  the  tails,  it  allows 
covariates  to  impact  the  extremal  quantiles  and  the  central  quantiles  very  differently;  the 
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impact  of  covariates  on  extremal  quantiles  is  approximated  by  (3^,  which  could  differ  sharply 
from,  for  example,  the  impact  on  the  median  given  by  /3(l/2).  Chernozhidrav  (2005)  provides 
further  generalizations  of  this  model. 

3.  Basic  Estimation  Methods 

3.1.  Estimates  based  on  sample  quantiles.  Given  T  observations  {Yt,t  =  1,  ...,T},  the 
r-sample  quantile  can  be  obtained  by  solving  the  following  optimization  problem: 

r 
Fy' (r)  e  arg  min      V pr  [Yt  -  (3) ,  (3.6) 

where  Pt{u)  =  (r  —  \{u  <  0))u  is  the  asymmetric  absolute  deviation  function  of  Fox  and 
Rubin  (1964). 

Sample  quantiles  are  also  order  statistics,  and  we  will  refer  to  tT  as  the  oi^der  of  r- 
quantile.  The  sequence  of  quantile  index-sample  size  pairs  (r,  T)  will  be  said  to  be  an 
extreme  order  sequence  if  r  \  0  and  rT  -^  k  >  0,  an  intermediate  order  sequence  if  r  \  0 
and  tT  — >  oo,  and  a  central  orrfer  sequence  if  r  is  fixed  and  T  — >  oo.  Each  different  type  of 
the  sequence  leads  to  different  asymptotic  approximations  to  the  finite-sample  distributions 
of  sample  quantiles.  Extreme  order  sequences  lead  to  non-normal  (extreme- value)  distri- 
butions (EV)  that  approximate  the  finite  sample  distributions  of  extremal  (high  and  low) 
quantiles  much  better  than  the  normal  distributions  do.  In  particular,  EV  distributions 
work  much  better  than  the  normal  distribution  if  tT  <  30. 

3.1.1.  Extreme  Order  Quantiles.  Consider  an  extreme-order  seciuence.  The  following  is  the 
classical  result  on  the  limit  distribution  of  order  statistics:  for  any  integer  A:  >  1  and 
r  —  k/T,  as  T  —^  oo. 


/here 


AriFyHr)  -  Fy^r))  -,  T'^  -  fc-«,  (3.7) 


At^I/F^Hi/T),     Tfc  -  £■!  +  ...  + 4,  (3.. 


and  (£'i,£^2!  ■■■)  is  an  independent  and  identically  distributed  sequence  of  standard  exponen- 
tial variables. 

Result  (3.7)  was  obtained  by  Gnedenko  (1943)  under  the  assumption  that  Yi,Y2,--.  is 
a  sequence  of  independently  and  identically  distributed  (i.i.d.)  random  variables.  Result 
(3.7)  continues  to  hold  for  stationary  weakly-dependent  series,  provided  the  probability 
of  extreme  events  occurring  in  clusters  is  negligible  relative  to  the  probability  of  a  single 
extreme  event  (Meyer  1973).  The  results  have  been  generalized  to  more  general  time  series 
processes  (Leadbetter,  Lindgi'en,  and  Rootzen  1983). 
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Result  (3.7)  gives  an  extreme  value  (EV)  distribution  as  an  approximation  to  the  finite- 
sample  distribution  of  Fj7^(r).  The  EV  distribution  is  characterized  by  the  EV  index  i^, 
vifhich  can  be  estimated  by  one  of  the  methods  described  below.  Variables  F/j,  entering 
the  definition  of  the  EV  distribution,  are  known  as  gamma  random  variables.  The  limit 
distribution  of  the  k-th  order  statistic  is  therefore  a  transformation  of  a  gamma  variable. 
The  EV  distribution  is  not  symmetric  and  may  have  significant  (median)  bias.  The  EV 
distribution  has  finite  moments  if  (^  <  0  and  has  finite  moments  of  up  to  order  1/.J  if  ^  >  0. 

The  classical  result  is  not  feasible  for  purposes  of  inference  on  Fy^lr),  since  the  scahng 
constant  At  is  not  easily  estimable  consistently.  One  way  to  overcome  this  problem  is  to 
make  additional  strong  assumptions  in  order  to  estimate  At  consistently.  For  instance, 
suppose  that  FJ" ^(r)  ~  Lt~^ ,  then  one  can  estimate  ^  using  methods  described  below  and 
L  by  L  =  {Fy\2T)  -  F,Ti(r)))/(2-«  -  l)r-f). 

Another  way  to  overcome  the  aforementioned  infeasibility  is  to  consider  the  asymptotics 
of  self-normalized  extreme  order  quantiles,  as  in  Chernozhukov  (2006): 

Zt{t)  =  AT(Fy\r)  -  FyHr))  ->,  ^^ff  ~S'^ .  (3-9) 


r~?  —  r" 

•^  mk        '■  k 


where  for  m  >  1  such  that  mk  is  an  integer. 


^r=-^-T ^   , .  (3.10) 

Fy\viT)    ~    Fy\T) 

Here,  the  scaling  factor  At  is  feasible  in  that  it  is  completely  a  function  of  data.  The  limit 
distribution  only  depends  on  the  EV  index  ^,  and  its  quantiles  can  be  easily  calculated 
analytically  or  by  simulation. 


3.1.2.  Intermediate  Order  Quantiles.  Consider  next  an  intermediate  order  secjuence.  As 
r  \  0  and  tT  —>  oo,  under  further  regularity  conditions, 

Zt{t)  ^  AT{Fy\T)  -  Fy'ir))  -,  A^  (o,  ^,J\^,^  ,  (3.11) 

where  At  is  defined  as  in  (3.10).  This  result,  obtained  by  Dekkers  and  de  Haan  (1989), 
gives  a  normal  asymptotic  approximation  to  the  finite-sample  distribution  of  sample  cjuantile 
Fy^{t).  The  main  condition  for  application  of  this  distribution  is  that  tT  -^  oo.  In  finite 
samples,  we  may  interpret  this  as  requiring  that  tT  >  30  at  the  minimum. 

The  normal  approximation  (3.11)  is  convenient,  but  extreme  approximation  (3.9)  is  al- 
ways better,  because  it  does  not  fail  when  tT  -^  k  <  oo  and  it  coincides  with  the  normal 
approximation  (3.11)  once  k  is  large. 
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3.1.3.  Extremal  Bootstrap.  In  many  cases  it  is  convenient  to  implement  inference  using  the 
following  approach.  Consider  the  sample  of  i.i.d.  variables: 

{Yu...,Yt)-(^^,-J-^],  (3.12) 

where  {S\,...,8t)  is  an  i.i.d.  sequence  of  standard  exponential  variables.®  Variables  gener- 
ated in  this  way  have  quantile  function: 

■  Fy\r)^tMl^J^n^,  .  (3.13) 

Observe  that  Fy7 ^(r)  —  1/.^  ~  t~^/^,  so  condition  (2.1)  is  satisfied.  We  propose  to  estimate 
the  finite-sample  distributions  of  Zt{t)  —  AriFy^ {t)  —  Fy^{t))  by  the  finite-sample  dis- 
tribution of  Zt{t)  for  the  case  when  the  data  follow  (3.27).  In  this  way,  we  reproduce  both 
the  EV  limit  (3.9)  and  the  normal  limit  (3.11)  under  extreme  and  intermediate  sequences, 
and  also  guarantee  good  finite-sample  performance  for  the  case  when  (3.27)  holds  exactly. 
The  simulation  can  be  done  using  the  following  algorithm: 

1.  For  each  i  <  B,  draw  {Yi,...,Yt)  as  i.i.d.   according  to  (3.27),  replacing  ^  with  a 
suitable  estimate  ^.  Compute  the  statistic  ZT^iir)  =  AriFy'^ {t)  —  Fy^{t)). 

2.  Use  quantiles  of  the  simulated  sample  {ZT,i{T),i  <  B)  for  inference  purposes. 

This  scheme  could  be  used  to  estimate  distributions  of  other  statistics,  including  estimators 
of  the  EV  index  and  extrapolation  estimators. 

Another  method,  developed  in  Chernozhukov  (2006),  is  based  on  subsampling  the  self- 
normalized  quantile  statistic.  This  method  is  less  accurate  than  the  extremal  bootstrap. 
However,  it  applies  under  more  general  conditions.  It  should  be  noted,  that  the  canonical 
(nonparametric)  bootstrap  does  not  work  in  these  settings  (Bickel  and  R'eedman  1981). 

3.1.4.  Confidence  Intervals  for  Fy  [t)  and  Bias  Correction  for  Fy  (r).  Let  the  a-quantile 
of  Zt{t)  be  denoted  by  c{a).  The  estimates  of  c{a)  can  be  obtained  using  either  EV 
approximation,  normal  approximation,  or  the  extremal  bootstrap,  also  having  replaced  <^ 
with  a  suitable  estimate.  Denote  the  resulting  estimates  by  c{a.).  Then,  the  median  bias- 
corrected  estimate  and  a%-confidence  region  for  Fy'^{T)  can  be  constructed  as 


.,.,.)  -  3^   and 


P,.M-31^.f.-(.)-^ 


(3.14) 


Yj,  defined  in  this  way,  follows  generalized  extreme  value  distribution,  which  nests  the  Frechet,  Weibull, 
and  Gumbell  distributions.  There  are  other  possibilities,  for  example,  (fi, ...,  ifr)  in  (3.27)  can  be  replaced 
by  uniform  variables  {Ui,  ...,Ut)  ,  in  which  case  Yt  follows  the  generalized  Pareto  distribution. 
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3.1.5.   EstimMors  of  tli.e  EV  Index  ^.  There  are  two  principal  estimators.  The  first  estima- 
tor, due  to  Pickands  (1975),  relies  on  the  ratio  of  sample  quantile  spacings: 


^  =  -\n 


In  2,  (3.15) 


Fy\2T)-Fy'{T)) 

such  that  T  — >  0  and  tT  ^>  oo  as  T  — >  oo.  Under  further  regularity  conditions, 


Another  estimator,  developed  by  Hill  (1975),  is  a  moments  estimator:^ 

,-,£LmV^,-W)-  ,3,,, 

such  that  r  ^  0  and  tT  — *  oo  as  T  — >  oo.  This  estimator  is  applicable  only  for  the  case  of 
^  >  0.  The  estimator  can  be  motivated  by  a  maximum  likelihood  method  that  fits  an  exact 
power  law  to  the  tail  data.  Under  further  regularity  conditions, 

v^{i-o^dj^{o,e)  (3.18) 

The  methods  for  choosing  r  are  described  in  Embrechts,  Kliippelberg,  and  Mikosch 
(1997).  The  variance  of  estimators  decreases  as  r  increases,  but  the  bias  (relative  to  the 
true  £,)  goes  up.  Another  view  on  the  choice  of  r  is  the  following:  statistical  models  are 
approximations,  not  literal  descriptions  of  the  data.  In  practice,  dependence  of  £^  on  the 
threshold  r  reflects  that  power  laws  with  different  values  of  ^  fit  better  different  tail  regions. 
Therefore,  if  the  interest  lies  in  making  inference  on  Fy^{t)  for  a  particular  r,  it  seems  rea- 
sonable to  use  ^  constructed  using  the  same  r  or  most  similar  r'  subject  to  the  condition 
that  t'T  >  30.^ 

The  limit  results  above  can  be  used  for  the  construction  of  confidence  regions.  As  an 
alternative,  we  can  apply  extremal  bootstrap  to  statistic  Zt  =  \/tT(^  —  cj)  to  estimate 
the  quantiles  of  Zj--  Given  the  estimated  a-cjuantiles  c{a),  we  can  construct  the  median 
bias-corrected  estimate  and  Q%-confidence  regions  for  ^. 

3.1.6.  Extrapolation  Estimators.  When  very  extreme  ciuantiles  cannot  be  estimated  pre- 
cisely, the  following  strategy  is  sensible:  estimate  less  extreme  ciuantiles  reliably,  and  then 
extrapolate  these  estimates  using  the  assumptions  on  tail  behavior  stated  earlier.  Dekkers 
and  de  Haan  (1989)  developed  the  following  extrapolation  estimator: 

F-\r,)  =^ldl^l^\F-'C2r)  -  Fy\T)\  +  Fy^r),  (3.19) 


Notation  (i)-  means  (i)_  =  —x  if  x  <  0  and  [x)-  =  0  if  x  >  0. 

o 

The  latter  condition  requires  that  a  sufficient  sample  be  available  to  estimate  ^. 
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where  Te  <^  t.  Another  useful  estimator,  which  is  vahd  only  for  the  case  of  ^  >  0,  is  the 
following: 

FyHre)  =  ire/Tr^-Fy\T),  (3.20) 

where  Tg  <C  r.  The  above  estimators  have  good  properties  provided  the  quantities  on  the 
right-hand  side  of  (3.19)  and  (3.20)  are  well  estimated,  which  requires  that  tT  be  large, 
and  that  the  tail  model  be  a  good  approximation  of  the  underlying  true  tail. 

3.2.  Estimates  based  on  sample  regression  quantiles.  Given  T  observations  {Yt,Xt,t  — 
1,  ■■■,T},  the  quantile  regression  estimate  of  Fy7^(r|a:)  is  given  by: 

Fy\T\x)=x'f3{T),     /?(r)  =.  arg  min      T  pr  {Yt  ~  X[p)  ,  (3.21) 

where  Pt(u)  =  (t  -  l{u  <  0))u.  Quantile  regxession  was  introduced  by  Laplace  (1818)  for 
the  median  case.  Koenker  and  Bassett  (1978)  extended  this  formulation  to  other  quantiles. 

3.2.1.  Extreme  Order  Asymptotics.  Chernozhukov  (2005)  derives  asymptotic  distributions 
of  regression  quantiles  under  extreme  order  sequences.  Consider  the  canonically-normalized 
QR  statistic 

Zrik)  =  At{iS{t)  -  /?(t)),  where  At  =  l/F-\l/T),  (3.22) 

and  the  self-normalized  QR  statistic 


ZT{k)=AT{l3{T)-i3{T)),  where  >^T 


-,  (3.23) 

X'iPimr)  -  P{t)) 

where  TT{m.  -  1)  >  d.  The  first  statistic  uses  an  infeasible  canonical  normalization,  while 
the  second  statistic  uses  a  feasible  normalization.  Then  as  rT  — >  A;  >  0  and  T  ^  oo 


ZT{T)^dZ^\k)-k~^, 


(3.24) 


Z<xi{k)  =   argmin 
Zoo{k)  ~  —argmin 


kE[Xrz  +  J2\xlz-T-' 

~kE[xrz  +  Y^\xlz  +  r;' 


i=l 


{^  <  0) 


(e>o) 


where  {ri,r2,...}   :=   {^i,!?!  +£'2,...}  and  {81,82, ■■■}  is  an  iid  sequence  of  exponential 
variables  that  is  independent  of  {Xi,  X-y, ...}.  Further,  for  any  m  such  that  k{m  —  I)  >  d, 

VkZ^{k) 


Zt{t) 


(3.25) 


E[X]'{ZUr,zk)-Z^\k)y 
The  results  hold  under  the  assmnption  that  the  data  come  from  either  an  i.i.d.  sequence 
or  a  stationary  weakly-dependent  sequence  with  extreme  events  satisfying  a  non-clustering 
condition. 
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Related  results  for  canonically  normalized  statistics  for  the  case  where  tT  — '  0  as  T  — >  oo 
have  been  obtained  by  Knight  (2001)  and  Portuoy  and  Jureckova  (1999). 

3.2.2.  Intermediate  Order'  Asymptotics.  Chernozhukov  (2005)  shows  that  under  intermedi- 
ate order  sequences,  as  r  \  0  and  tT  ^>  oo, 

ZT{r)=AT{p{T)-P{T))-^,Af(^0,[E(XX')l'j--^l^-^y  (3.26) 

where  At  is  defined  as  in  (3.23).  Like  the  result  under  extreme  order  sequences,  this  result 
holds  under  the  assumption  that  the  data  come  either  from  an  i.i.d.  sequence  or  from 
a  stationary  weakly-dependent  sequence  with  extreme  events  satisfying  a  non-clustering 
condition. 

3.2.3.  Extremal  Bootstrap.  In  practice,  it  is  convenient  to  implement  inference  by  construct- 
ing a  bootstrap  model  that  approximates  the  tail  features  of  the  true  conditional  quantile 
model  under  the  assumptions  of  Section  2.2;  then  using  this  model  to  simulate  the  distri- 
butions of  estimators  of  extreme  quantiles  and  tail  parameters. 

Consider  the  sample 

{{Y,,X,),...,iYr,Xr))  ^  ('f^^,X,  V  ....  {^l^^X^  ,  (3.27) 

where  (i?i,  ...,£^r)  is  an  i.i.d.  sequence  of  standard  exponential  variables  and  (Xi,...,X7')  is 
fixed  set  of  observations  on  regi'essors  that  we  have.  Variable  Yt  generated  in  this  way  has 
the  conditional  quantile  function 


,_i,    ,,,.       ,.,..   .        hind 

■Yt    \-  '---'        '-<.-v-  '  _^ 


Fy^\T\X^)  =  X[(3{t) 

where,(r).(bMl^llI:i=-I,0,...,0)\  ^^''^ 

Observe  that  Fy  [T\Xi)  —  1/^  ~  t~^/^,  so  the  model  satisfies  conditions  (2.4)  and  (2.5), 
as  does  the  true  conditional  quantile  model  under  our  assumptions.  Hence  we  can  estimate 
the  finite-sample  distributions  of  Zt{t)  —  Aj-{p{r)—P{T))  by  the  finite-sample  distribution 
of  Zt{t)  in  the  case  when  data  follows  (3.27).  In  this  simple  way,  we  can  replicate  both 
the  EV  approximation  (3.26)  and  the  normal  approximation  (3.24)  and  also  guarantee  good 
finite-sample  performance  for  the  case  when  the  model  (3.27)  holds.  The  simulation  can  be 
done  using  the  following  algorithm: 

1.  For  each  i  <  B,  draw  data  according  to  (3.27),  replacing  ^  with  a  suitable  estimate 
^.  Compute  the  statistic  Zt,i{t)  =  At{P{t)  —  P{t)). 

2.  Use  the  empirical  distribution  of  the  simulated  sample  (Zy  j(r),  i  <  B)  for  inference. 
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This  method  can  also  be  used  to  estimate  distributions  of  estimators  of  the  EV  index  and 
extrapolation  estimators  described  below. 

As  mentioned  before,  there  is  another  inference  method  proposed  by  Chernozhukov  (2006) 
which  uses  subsampling  to  estimate  the  distribution  of  self-normalized  statistic  Zt{t).  This 
method  is  less  accurate  than  the  extremal  bootstrap,  but  it  applies  under  more  general 
conditions. 


3.2.4.  Confidence  Intervals  and  Bias  Corrected  Estimates.  Suppose  we  are  interested  in  the 
parameter  tp' (3{t)  for  some  non-zero  vector  ■)/'•  Let  the  a-quantile  of  4'' Zt{t)  be  denoted  by 
c{a).  Having  replaced  ^  with  a  suitable  estimate,  the  estimates  of  c(q)  can  be  obtained  using 
either  EV  approximation,  normal  approximation,  or  the  extremal  bootstrap.  Denote  the 
resulting  estimates  by  cla).  The  median-bias  corrected  estimator  and  the  Q;%-confidence 
interval  for  4''(3{t)  can  be  constructed  as 


^'/3(7 


c(l/2) 
At 


and 


i'Pir) 


g(«/2) 
At 


A>'P{t) 


cjl  -  a/2) 
At 


(3.29) 


3. 2. .5.   Estimators  of  the  EV  Index  ^.  The  following  estimators  are  regression  analogs  of  the 
Pickands  and  Hill  estimators.  The  first  estimator  takes  the  form 


^^-In 


Fy'MX)-Fy\2r)\X) 


Fy\2T\X) 


Fy\T)\X) 


In  2, 


where  X  is  the  average  value  of  A'j.  Under  additional  regularity  conditions,  as  r 
tT  -^  DO  ^ 


(2(2f-l)ln2)2^ 
The  second  estimator,  which  is  applicable  when  ^  >  0,  takes  the  form: 


i  = 


Y:UHyt/Fy\r\Xt))- 

Tt 


Under  additional  regularity  conditions,  as  r  \  0  and  tT  — >  oo 

v^(e-o-d^v-(o.s^'). 


(3.30) 
0  and 

(3.31) 

(3.32) 
(3.3.3) 


The  limit  results  above  can  be  used  for  the  construction  of  confidence  regions.  An  alter- 


native approach  is  to  apply  extremal  bootstrap  to  statistic  Zt 


'tT{^ 


to  estimate 


the  quantiles  of  this  statistic.  Then  we  can  use  estimated  a-quantiles  c{a)  for  constructing 
the  median  bias-corrected  estimate  and  a%-confidence  regions  for  ^: 


e 


?(l/2)         , 
^-^ — -  a.nd 

?_ 

?(l-Q./2)     ^ 

c{a/2) 

^/^ 

TTT 

v^ 

(3.34) 
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3.2.6.  Extrapolation  Estimators.  By  analogy  with  the  unconditional  case,  the  extrapolation 
estimators  for  Fy   (Te|x),  where  r^  is  a  very  low  value,  can  be  constructed  as 

Fy\Te\x)  =  ^^'^^l  ^  ~  ^  \Fy\mT\x)  -  Fy'{T\x))]  +  FyHrlx),  (3.35) 

Fy\Te\x)  =  {Te/T)-^Fy'{T\x)       (^  >  0),  (3.36) 

where  Tg  <Si  r.  Note  that  the  estimator  (3.36)  is  valid  only  in  the  case  ^  >  0.  The  comments 
given  for  the  unconditional  case  apply  here  as  well.  Also,  we  can  construct  confidence  regions 
for  Fy  ^(Te|x)  based  on  extrapolation  estimators.  This  can  be  done  by  applying  the  extremal 
bootstrap  to  statistic  Zt  =  ATiFy\Te\x)-Fy\Te\x))  where  At  =  s/rf / X' (p{2T)-d{T)). 


4.  Empirical  Applications:  an  Overview  and  an  Illustration 

4.1.  A  Simple  Overview.  The  following  review  is  not  exhaustive  by  any  means;  it  aims 
to  provide  only  a  few  quintessential  references. 

4.1.1.  Extremal  Unconditional  Quantiles.  As  mentioned  in  Section  2,  Pareto  analyzed  in- 
come and  wealth  data  in  1895  and  suggested  that  power  laws  accurately  describe  the  tail 
data.  Pareto's  discovery,  although  remarkably  simple,  had  a  profound  elTect  on  both  em- 
pirics and  the  theory  of  extremes.  Zipf  (1949),  Mandelbrot  (1963),  Fama  (1965),  Praetz 
(1972),  Sen  (1973),  Jansen  and  de  Vries  (1991),  Longin  (1996),  among  others,  gave  fur- 
ther empirical  evidence  on  the  nature  and  prevalence  of  Pareto-type  laws  in  economic  data, 
including  city  sizes,  incomes,  and  financial  returns. 

It  should  be  mentioned  that  many  of  the  early  studies  were  highly  informal  in  nature. 
The  theoretical  work  in  extreme  value  theory  has  opened  paths  for  better  analysis.  Pi-om 
this  aspect,  the  study  of  Jansen  and  de  Vries  (1991)  can  be  singled  out  as  it  gave,  to 
our  knowledge,  the  first  highly  rigorous  analysis  of  the  tail  properties  of  financial  returns. 
Jansen  and  de  Vries  (1991)  estimate  the  EV  indices  for  various  primary  U.S.  stocks  to  be 
between  ,^  =  1/5  and  ^  =  1/3.  Using  quantile  extrapolation  estimators  to  estimate  value- 
at-risk,  they  also  conclude  that  the  1987  market  crash  was  not  an  outlier.  Rather  it  was  a 
rare  event,  the  magnitude  of  which  could  have  been  predicted  using  prior  data.  This  study 
stimulated  numerous  other  studies  that  rigorously  document  the  tail  properties  of  economic 
data  (Embrechts,  Kluppelberg,  and  Mikosch  1997). 

4.1.2.  Extremal  Conditional  QuantUes.  There  has  been  considerably  less  work  on  condi- 
tional methods.  However,  following  recent  theoretical  advances  we  expect  that  this  area 
will  see  active  development  in  the  near  future.  In  what  follows,  we  merely  highlight  some 
of  the  topics  and  directions. 
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In  what  might  be  the  earhest  example  of  conditional  quantile  analysis,  Quetelet  (1871) 
fitted  various  conditional  quantile  curves  to  age-height  data.  Remarkably,  Quetelet's  work 
included  tabulations  of  very  high  and  very  low  quantiles  of  heights  as  a  function  of  age. 
There  is  a  great  potential  for  the  applications  of  extremal  quantile  regression  methods  in 
similar  problems.  In  a  recent  study,  Chernozhukov  (2006)  estimates  the  impact  of  smoking 
and  maternal  behavior  on  extremely  low  birthweights  in  the  U.S.,  focusing  on  black  mothers. 
He  finds  that  the  impact  of  these  variables  on  birthweights  in  the  ranges  between  2.50  and 
1500  grams  sharply  differs  from  their  impact  on  the  central  birthweights.  For  instance, 
smoking  is  not  correlated  with  extremal  birthweights,  while  quality  of  prenatal  medical 
care  is  strongly  linked  to  extremal  birthweights. 

Aigner  and  Chu  (1968),  Timmer  (1971),  and  Aigner,  Amemiya,  and  Poirier  (1976)  pio- 
neered a  large  empirical  literature  on  production  frontiers.  A  major  problem  of  the  sub- 
sequent empirical  hterature  has  been  the  lack  of  statistical  methods  for  construction  of 
rehable  estimates  and  confidence  regions.  The  new  methods  discussed  in  Section  3  solve 
the  problem,  and  should  improve  the  rigor  of  the  empirical  work  in  this  area. 

There  is  a  considerable  appeal  for  the  use  of  extremal  conditional  cjuantile  methods  in 
auction  models.  An  important  study  that  illustrates  the  potential  is  by  Donald  and  Paarsch 
(2002),  who  analyze  an  empirical  structural  auction  model.  They  estimate  the  conditional 
support  function  of  bids  using  extreme  order  statistics  for  each  covariate  cell,  then  project 
the  estimated  function  onto  a  lower  dimensional  structural  function  implied  by  the  model 
via  a  minimum-distance  method.  A  generalization  of  this  approach  is  to  employ  extremal 
quantile  regression  for  estimation  of  the  (approximate)  support  function  in  the  first  stage. ^ 

Value-at-risk  is  another  potentially  important  area  of  applications  of  the  extremal  quan- 
tile regression.  Chernozhukov  and  Umantsev  (2001)  apply  these  methods  to  the  problem  of 
forecasting  value-at-risk  of  a  major  U.S.  oil  company.  They  estimate  extremal  conditional 
quantiles,  using  both  ordinary  and  extrapolation  methods,  and  implement  confidence  re- 
gions, using  subsampling  methods  described  in  Chernozhukov  (2006).  The  section  below 
briefly  revisits  some  of  the  main  questions  of  this  study. 

4.2.  An  Illustrative  Example.  Here  we  consider  a  problem  of  forecasting  conditional 
value-at-risk.  We  revisit  some  of  the  questions  asked  in  Chernozhukov  and  Umantsev  (2001) 
with  an  improved  methodology.  To  implement  the  analysis,  we  use  algorithms  written  in 
R  language  that  rely  on  Koenker's  (2006)  quantreg  package  as  the  basic  platform.  The 
algorithms  as  well  as  the  data  set  can  be  downloaded  from  www.mit.edu/~vchern/EQR/. 
A  detailed  description  of  the  data  set  is  given  in  Chernozhukov  and  Umantsev  (2001). 


The  use  of  near-extreme  quantile  regression  for  estimation  of  approximate  support  functions  allows  the 
researcher  to  discard  some  outliers  that  do  not  conform  the  model,  in  the  spirit  of  the  discussion  given  in 
Section  1. 
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We  estimate  the  following  conditional  quantile  function  for  various  low  values  of  r: 

Fy^\T\Xt)  =  X[(3{t)  =  /?o(r)  +  /3i(t)Xu  +  /?2(r)X,,2  +  [h{T)Xt^z.  (4.37) 

where  Yt  is  the  daily  (log)  return  on  the  stock  of  Occidental  Petroleum,  Xt^\  is  the  lagged 
return  on  spot  oil  price,  Xt,2  —  Yt-i  is  the  lagged  own  return,  and  Xt^z  is  the  lagged  return 
on  the  Dow  Jones  Industrial  Index.  There  are  2527  observations  in  the  sample. 

We  first  estimate  and  plot  the  function  (r,  i)  i-^  X[(3{t)  in  (r,  t)  space,  with  r  <  .2.5.  The 
graph  of  this  function,  shown  in  Figure  1 ,  gives  a  good  picture  of  the  evolution  of  risk  over 
time,  indicating  dates  where  the  predicted  risk  is  especially  high.  Let  us  next  determine 
what  causes  these  risk  fluctuations. 


0.25 


Figure  1.  The  fit  X[i3{t)  as  a  function  of  time  t  and 


Table  1  reports  the  estimates  of  the  coefficients  of  the  model  (4.37).  The  table  also 
reports  median  bias-corrected  estimates  and  90%  confidence  regions,  which  were  obtained 
using  the  extremal  bootstrap  approach  described  in  Section  3.  The  results  show  that  the 
primary  determinant  of  the  high  risk  levels  is  the  market.  The  further  in  the  tail  we  go.  the 
larger  is  the  magnitude  of  the  point  estimate  of  the  coefficient  on  the  DJI  return.  Moreover, 
the  confidence  region  for  this  coefficient  excludes  zero  even  at  r  =  .001. 
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Table  1.  Estimation  Results 
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Coefficients 

Estimate 

Bias-Corrected 

90%  Conf.  Region 

r  = 

.001 

Intercept 

-0.08 

-0.08 

[-0.11,-0.06] 

Lag  Return 

0.12 

0.01 

[-0.19,  0.63] 

Oil  Price 

-0.05 

-0.05 

[-0.42,  0.11] 

DJI  Return 

0.71 

0.73 

r  — 

.01 

[  0.05,  1.08] 

Intercept 

-0.05 

-0.05 

[-0.05,-0.04] 

Lag  Return 

-0.04 

-0.06 

[-0.16,  0.12] 

Oil  Price 

-0.05 

-0.06 

[-0.17,  0.02] 

DJI  Return 

0.49 

0.50 

r  = 

.05 

[  0.24,  0.65] 

Intercept 

-0.03 

-0.03 

[-0.03,-0.02] 

Lag  Return 

-0.03 

-0.04 

[-0.10,  0.04] 

Oil  Price 

0.01 

0.01 

[-0.04,  0.06] 

DJI  Return 

0.29 

0.30 

[0.17,  0.38] 

We  next  characterize  the  tail  properties  of  the  conditional  quantile  model.  Table  2  reports 
the  estimates  of  EV  index  ^  obtained  using  estimator  (3.32).  The  table  also  reports  median 
bias-corrected  estimates,  and  90%  confidence  regions,  which  were  obtained  using  the  nested 
approach  described  in  Sections  3.2.3  and  3.2.5.  The  bias-corrected  estimates  tend  to  be 
stable  with  respect  to  the  start  of  the  tail  determined  by  probability  index  r.  On  the  basis 
of  Table  2,  we  take  \  «  1/4  to  be  the  estimate  of  the  EV  index. 

Table  2.  Estimation  Results  for  EV  Index  <^ 

Estimate     Bias-Correct.  Estimate     90%  Conf.  Region 

[0.08,  0.34] 
[0.05,  0.25] 
[0.14,  0.30] 
[0.16,  0.27] 


r  =  .005 

0.24 

0.22 

T=  .01 

0.23 

0.17 

T  -  .025 

0.32 

0.24 

T  =  .05 

0.35 

0.23 

Having  characterized  the  EV  index,  we  can  now  estimate  very  extreme  quantiles  using 
extrapolation  methods.  We  set  the  risk  level  at  r  =  .0001,  so  that  the  return  falls  below 
Fy^  (.0001[Xf)  only  once  per  about  30  years,  a  very  rare,  extreme  event.  The  extrapolated 
estimates  of  .0001-quantile  are  obtained  using  equation  (3.36)  with  r  =  .05  and  ^  =  1/4. 
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The  resulting  extrapolation  fit  Fy^ {.000l\Xt)  sharply  differs  from  the  ordinary  fit  X[I3{t) 
obtained  by  quantile  regression.  The  reason  for  this  is  simple:  the  ordinary  fit  uses  sample 
data  that  likely  contains  no  observations  on  the  extreme  events  defined  above.  In  sharp 
contrast,  the  extrapolated  fit  uses  the  tail  model  and  a  reliably  estimated  conditional  .05- 
quantile  to  predict  the  magnitude  of  such  events.  The  quality  of  this  prediction  clearly 
depends  on  whether  the  tail  model  is  accurate. 
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Extrapolalated  vs  Ordinary  Estimates  of  Cond  Quantiles 
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Figure  2.  Extrapolated  and  Ordinary  Estimates  of  the  Conditional  .0001-Quantile. 

5.  Conclusion 

This  article  examines  the  theory  and  empirics  of  extremal  quantiles  in  economics.  The 
theory  of  extremes  provides  a  set  of  apphcable  methods  that  have  generated  numerous 
valuable  empirical  findings.  There  is  equally  promising  scope  for  the  use  of  the  extremal 
conditional  quantile  methods.  The  latter  methods  are  new  -  there  are  great  opportunities 
for  further  empirical  and  theoretical  developments. 


Refere.nces 

Abrevaya,   J.   (2001):     "The  effect  of  demographics  and  maternal  behavior  on  the  distribution  of  birth 
outcomes,"  Empirical  Economics,  26(1),  247-259. 


EXTREMAL  QUANTILES  AND  VALUE-AT-RISK  ■  17 

AlGNER,  D.  J.,  T.  Amemiya,  AND  D.   J.   PoiRiER  (1976):    "On  the  estimation  of  produclion  frontiers: 

maximum  likelihood  estimation  of  the  parameters  of  a  discontinuous  density  function,"  Internal.  Econom. 

Rev.,  17(2),  377-396. 
AlGNER,  D.  J.,  AND  S.  F.   Chu  (1968):    "On  Estimating  the  Industry  Production  Function,"   American 

Econom.ic  Review,  58,  826-839. 
Arrow,  K.,  T.  Harris,  and  J.  Marschak  (1951):    "Optimal  Inventory  Policy,"  Econornetrica,  19,  205- 

272. 
BiCKEL,  P.,  and  D.  Freedman  (1981):   "Some  Asymptotic  Theory  for  the  Bootstrap,"  Annats  of  Statistics, 

9,  1196-1217. 
Caballero,    R.,   and   E.   Engel  (1999):     "Explaining  Investment  Dynamics  in  U.S.  Manufacturing:    a 

Generalized  (S,s)  Approach,"  Econornetrica,  67(4),  783-826. 
Chernozhukov,  V.  (2005):   "Extremal  quantile  regression,"  Ann.  Statist.,  33(2),  806-839. 
(2006):   "Inference  for  Extremal  Conditional  Quantile  Models,  with  an  Application  to  Birthweights," 

MIT,  Department  of  Economics,  Working  Paper. 
Chernozhukov,  V.,  AND  L.  Umantsev  (2001):    "Conditional  Value-at-Risk;   Aspects  of  Modehng  and 

Estimation,"  Empirical  Econoraics,  26(1),  271-293. 
DE  Haan,  L.  (1970):    On  Regular  Variation  and  Its  Applications  to  the  Weak  Convergence.  Mathematical 

Centre  Tract  32,  Mathematical  Centre,  Amsterdam,  Holland. 
Dekkers,  a.,  and  L.  de  Haan  (1989):   "On  the  estimation  of  the  extreme- value  index  and  large  quantile 

estimation,"  Ann.  Statist,  17(4),  1795-1832. 
Donald,   S.   C,    and  H.   J.    Paarsch   (2002):     "Superconsistent  estimation  and  inference  in  structural 

econometric  models  using  extreme  order  statistics,"  J.  Econometrics,  109(2),  305-340. 
Embrechts,  p.,  C.  Kluppelberg,  and  T.  Mikosch  (1997):  Modelling  extremal  events,  vol.  33  of  Appli- 
cations of  Mathematics.  Springer- Verlag,  Berlin. 
Fama,  E.  F.  (1965):   "The  Behavior  of  Stock  Market  Prices,"  Journal  of  Business,  38,  34-105. 
Fox,  M.,  and  H.  Rubin  (1964):   "Admissibility  of  quantile  estimates  of  a  single  location  parameter,"  Ann. 

Math.  Statist,  35,  1019-1030. 
Gnedenko,  B.  (1943):   "Sur  la  distribution  limite  du  terme  d'  une  serie  aletoire,"  Ann.  m.ath.,  44,  423-453. 
Hill,  B.  M.  (1975):   "A  simple  general  approach  to  inference  about  the  tail  of  a  distribution,"  Ann.  Statist., 

3(5),  1163-1174. 
Jansen,  D.  W.,  and  C.  G.  de  Vries  (1991):   "On  the  Frequency  of  Large  Stock  Returns:  Putting  Booms 

and  Busts  into  Perspective,"  Review  of  Economics  and  Statistics,  73,  18-24. 
Knight,   K.   (2001):     "Limiting  distributions  of  linear  programming  estimators,"    Extremes,  4(2),  87-103 

(2002). 
KoENKER,  R.  (2006):  Quantreg:  Quantile  Regression.    R  package  version  3.90,  http://www.r-project.org. 
Koenker,  R.,  and  G.  S,  Bassett  (1978):   "Regression  Quantiles,"  Econometnca,  46,  33-50. 
Laplace,  P.-S.  (1818):   Theorie  analytique  des  probabilites.  Editions  Jacques  Gabay  (1995),  Paris. 
Leadbetter,  M.  R.,  G.  Lindgren,  and  H.  Rootzen  (1983):  Extremes  and  related  properties  of  random 

sequences  and  processes.  Springer- Verlag,  New  York-Berlin. 
LONGIN,   F.   M.   (1996):     "The  Asymptotic  Distribution  of  Extreme  Stock  Market  Returns,"    Journal  of 

Business,  69(3),  383-408. 
Mandelbrot,  M.  (1963):  "The  Variation  of  Certain  Speculative  Prices,"  Journal  of  Business,  36,  394-419. 
Meyer,  R.  M.  (1973):    "A  poisson-type  limit  theorem  for  mixing  sequences  of  dependent  "rare"  events," 

Ann.  Probability,  1,  480-483. 
Pareto,  v.  (1964):   Cours  d'economie  politique.  Droz,  Geneve. 

Pickands,  III,  J.  (1975):   "Statistical  inference  using  extreme  order  statistics,"  Ann.  Statist,  3,  119-131. 
Portnoy,   S.,  and  J.   Jureckova  (1999):    "On  extreme  regression  quantiles,"   Extremes,  2(3),  227-243 

(2000). 
Praetz,  V.  (1972):   "The  Distribution  of  Share  Price  Changes,"  Journal  of  Business,  45(1),  49-55. 
QuETELET,  a.  (1871):   Anthropometrie.  Muquardt:  Brussels. 

Roy,  a.  D.  (1952):   "Safety  First  and  the  Holding  of  Assets,"  Econornetrica,  20,  431-449. 
Sen,  a.  (1973):   On  Economic  Inequality.  Oxford  University  Press. 

Timmer,  C.   p.  (1971):    "Using  a  Probabilistic  Frontier  Production  Function  to  Measure  Technical  Effi- 
ciency," Journal  of  Political  Economy,  79,  776-794. 
ZiPF,  G.  (1949):  Human  Behavior  and  the  Principle  of  Last  Effort.  Cambridge,  MA:  Addison- Wesley. 


GOO^      3^1 


